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On Calculating Analytic Centers 

A. A. Goldstein* 



This note was motivated by papers of Renegar and Shub(88) and by Ye(89). We apply 
Smale’s(86) estimates at one point for Newton’s method to the problem of finding the 
analytic center of a polytope. The method converges globally in the appropriate norm. 
The ideas are then applied to obtain a possible benchmark for path following methods. 

When Smale’s method is tractable its power stems not only from the fact that the in- 
formation is concentrated at one point. There are 2 norms to estimate, not 3 as in the 
Kantorovich estimate. Moreover no estimate of the inverse of the derivative operator by 
itself is needed. The need for the norm of the inverse by itself often makes for coarse 
estimates. 

1. Setting 

Let A denote an m by n orthonormal matrix of rank n and b an m by 1 matrix. 
We assume that m > n. Denote by e a m by 1 matrix whose components are all ones. 
Transposes of matrices will be denoted by an asterisk, rows of a matrix by superscripts, 
and columns by subscripts. The euclidean space of real m-tuples will be denoted by E m . 
If u £ E m we mean by diag(u) the diagonal matrix with entries u lt . The dot product 
corresponding to the usual norm will be denoted by [ , ]. The usual norm will be written 
as || ||2 - E n will be also be renormed under a dot product that will be denoted by < , >. 
The norm arising from this dot product will be written as || ||. Let P be a polytope with 
non-empty interior given by the inequalities 

b — Ax > 0. 

Given xq in the interior of P and t > 0, we seek the analytic center £ of P to within a 
tolerance of e. Let i2,-( x) = bi — A l x . 

Claim 1. Let N be the smallest integer exceeding 

log 2 (4.95m 5 maxRi(xo)) + log 2 

Then if N steps of the Newton sequence are generated using the gradient of the potential 
function below 
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I|*N - f||2 < € 



The proof of the claim depends on the following ingredients. 

2. Some ingredients. 

The potential of P is defined by the expression 

m 

7 r(P) = max Ri(x) : x E P} 

1 = 1 



The maximum is achieved at an unique point called the analytic center of P. (Ye 87 ). 
We shall find this point by seeking a zero for the gradient of the logarithmic potential 

m 

<f>( x ) = X>sW*) 

z = l 



Let D(x) = diag(l/Ri(x)), thus D t i(x ) = l/i? t (x). 

We apply Newton’s method to the gradient of <j> which we denote by F. F(x) may be 
represented by the matrix A* D(x)e, and F(x) € E n . The kth Frechet differential of F at 
x can be identified with a multi-linear mapping from (if/v)* to E n . A representation of 
these differentials as matrices follows. 

F , (x)h 1 = —A*D 2 (x) diag(Ah\)e = -A* D(x)D(x)Ah 1 

F"(x,h\,h,2) = — 2 \A* D(x)D 2 (x) diag^Ah?) Ah} 



F"'(x, hi , /i2, /13) = — 3 \A* D(x)D 3 (x) diag(Ahz) diag(A}i2) Ah\ 

and 



F^ k \x,hi,h2,...,hk) = — fc\A* D(x)D k (x) diag(Ahk), . diag^Ah^) Ah\ 



= -k\A*D(x)Q(x,hi,...,h k ) 



Here 



IIQMIk = 5tip{||(3(.-r./ii,...,fc,.)|| 2 : ||/„|| 2 = ||fc 2 || 2 , 



llfcilh = 1} < 1 
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Theorem 1. (Smale 80 ) Assume F is an analytic map between real Banach spaces X and 

Y, that is the Frechet derivatives F^ k \x) exist for all x 6 X and k=l,2,3, Given x 0 € 

X, assume that the inverse of F'(x ) which we denote by F'_ 1 (x) exists. Set 



P(xo) = ||F’ii(xo)F(xo)|| and 



If 



7(*o) = «<p{ll 4fl 1 (x„)f’<‘>(*„)||l±T : k > 2| 

/3(xo)y(xp) < .130707 



then Xq is an approximate root of F. That is, the Newton sequence 



Xjt+i = Xk ~ F'_ 1 (x k )F(xk ) 



is well defined and {x/;} converges to say a root of F at the rate: 



Ikfc+i -sjtll < 2(^) 2 0(xo) 

Moreover 

II**- <11 < (A) 



3. Proof of Claim 1. 

Assume Xq is given in P(M). The matrix 

P(x 0 ) = D(x 0 )A(A* D(x 0 )D(x 0 )A)~ 1 A* D(xo) 

maps each point in E m to its closest point in the range of the matrix D(xq)A . Hence 
||P(x 0 )|| 2 = 1. We renorm E„ by 

1 1 x || = \\D c Ax\\ 2 

Here Dc = C-D(x 0 ) with C = With this definition we get: 

/?(x 0 ) = C||P(x 0 )e|| 2 = m*/8^ 

Also 

7(xo) < C sup (||P(x 0 )||2 _1 ) sup (\\Q k Ahi || 2 _ 1 ) < C 

Thus fl(xo)i{xo) < - < .130707 

8 
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Hence by Smale’s theorem the sequence generated by the Newton algorithm converges to 
the analytical center £ with a rate given by (A) in Theorem 3.1 above. 

Since < x, x >= [ DqAx , D c Ax ] > C 2 \\x\\l/mnxRi(x 0 ) 2 , then 

IMI2 < CRi{x 0 )||*|| 

Now choose N so that 

C R,(x 0 ) ||xyv — £|| < e 

4. Application to programming 

By a theorem of Ye (89), if one of the hyperplanes of P is translated to pass thru £ then 
the resulting polytope satisfies 

< 1 

7 t(P) ~ e 



Consider the following algorithm for linear inequalities. We wish to solve the system 
b — Ax > 0 if this is possible. Given an arbitrary xq choose M so that b + M — Ax > 0. 
Find the center of this polytope P(M). Take the smallest component of i?(£), say R g ( £). 
Begin anew with the polytope P(M — R q {£)). This algorithm has a worst case iteration 
count of O(m) times our cost of getting to the center. 

For linear programming let the polytope P be given by b - Ax > 0 and P(M) the polvtope 
define by the inequalities for P together with the inequality M - [c , x] > 0. We seek the 
smallest M for which P(M) is non-empty. We first find the center £ of the polytope P. 
We then find the intersection of the ray {x = £ — tc : t > 0} with P Translate the cost 
hyperplane to pass thru this point. Then find the center of the new poly tope P(M). 

5. Benchmark 



We now consider the possibility of starting from a point in a poly tope P(M) and moving 
to the center of a neighboring polytope P(M — 1/2 y/rn) by Newton steps. 



Assume that at (xq,A/ 0 ), R t (x,M) 



that 



d4>(x. M) 

d. Tj 

d<p(x,M) 

DM 



b , + M 0 — A 1 x > 0. We seek a point (aq , M \ ) such 



0, 1 5: j 5: n ( 1«) 



dd>(*uMi) 

DM 



( 16 ) 
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and such that 



Riibi + Mt - A' Xl ) > 0 (2) 

Let Mi = Mo — 1/2 y/rn. Assume that the value of x\ is well defined and given. Otherwise 
P(M \ ) is empty and Mo is within 1/2 y/m of M* the optimal value of M. We show that 
(xo,Mo) is an ’’approximate root” for system (I). 

Remark The matrix (A e) has rank n+1. 

Proof Because of our boundedness assumption on the polytopes, the system of inequalities 
Ax > 0 is inconsistent. If u is in the null space of (A e) then Au = -u n +i« 7 ^ 0, a 

contradiction. 

In matrix notation the system (1) (after scaling the second entry) is 



F(x,M)= ( —A*D(x, M)e — \=e\D{x,M)e - D(x u Mi)e)\ 

\ 2yjm J 



(i) 



Thus we see that - F\x^M) may be generated from the matrix 



B = (Aj, A 2 ,..., A n , A n+1 ) where A n+1 = 

Assume that (Aj, A 2 , ..., A n ) is rescaled if necessary so that ||B|| < 1. By the Remark we 
see that B has rank n+1. Thus Claim 1 holds for this case as well. If we are satisfied with 
a reduction of 1/3 yfm this will happen in N steps by the claim with e set to 1/6-^/m. We 
have then the following result: (not an algorithm but a benchmark) 

Claim 2. We are given a point (xjt,Mjt),. Let Mjt+i — M* — 1/2 y/m. If P(Mjt+i) is 
not empty, take xjt + i for its center. Let the system (I) be run with Newtons’ method. 
Otherwise, stop. In N steps M* will be reduced by at least 1/3 y/in. This value updates 
M*+i and the corresponding iterate for x updates Xk+i- Assume the optimal M say M* 
known. Then the global Newton process can be terminated in no more than Q steps, where 

Q > 3y/m(M 0 - M*)(l + log 2 
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At termination A//v is within 1/2 \/m of M* and x/v is an approximate root for system (I) 
with M* replacing M\ and £ replacing xj, respectively. 

A similar result holds for linear programming. 
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